Data Analysis

• Size: 1.2GB
• Shape: 21,00,000 Rows & 147 Columns
• Data Source: Kaggle

Data Visualization

• Used Tableau to visualize lending data across United States.
• Used Sweetviz Library to get the distribution of each feature.

Default Prediction

• Get your Interest Rate, Grade, Sub Grade based on the FICO Score provided
• Get your loan approval chances by providing few necessary informations.

Are you looking for a Individual Loan or a Joint Loan?

Visualization

Get state-wise lending industry information
of United States:

  • Select the state about which you want to know the following information
  • Loan to Income Ratio vs Grades graph shows the variation of Fully Paid and Defaulted loans based on the two parameters. It was observed that usually the ratio is high for lower grade loans and higher for Defaulted Loans across grades.
  • Know the total amount of loans issued over the years. Both Fully Paid and Defaulted loans peaked near 2015.
  • From the box plot, observe how the interest rates offered for loans vary for different grades. Usually, lower the grade, higher would be the interest rate offered by banks to cover their risk.
  • The graph of average interest rate vs loan amount shows how the defaulted loans always had a higher interest rate across the range of loan amount.
  • Highest purposes of the disbursed loans for a particular US state chosen

Lending Club Data Analysis (Tableau Visualization)

Few Insights.

Using K nearest neighbours for prediction of Interest Rate

System tries to predict the Sub-Grade based on the FICO score provided.

  • One would expect the relation to be linear between FICO score and sub grade but that is not exactly the case.
  • From the sub-grade predicted we get the Interest Rate for the applicant.

To know more about the data analysis look at this project on Github